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FOREWORD 



Helping school board members to lead efforts to raise student achievement 
in their school districts is a priority of the National School Boards Association. 
In keeping with this effort to promote excellence in education, the Council of 
School Attorneys presents Student Testing and Assessment: Answering the Legal 
Questions. Intended to help school attorneys understand the legal issues that 
have emerged over student testing as a component of the current reforms, this 
book first provides some context to that discussion by briefly looking at the 
some of the testing programs being used across the nation and examining some 
pertinent federal developments. An overall framework for evaluating the legal 
integrity of testing policies is also set forth. The following chapters analyze the 
key legal issues surrounding testing used for high-stakes purposes including 
discrimination, due process, testing of students with disabilities and English 
language learners, and the use of student test scores to evaluate educational 
professionals and institutions. The appendices include excerpts from several 
federal documents concerning student testing, some recommendations from a 
national study of high-stakes testing, state statutory references on standards and 
assessments and a list of other useful resources on this subject. 

While these articles strive to provide accurate legal information, they are not 
intended to be legal advice. When questions arise concerning policies or prac- 
tices with respect to student testing, a qualified professional should be con- 
sulted. 

I wish to thank the school attorneys who contributed their time and exper- 
tise to this publication. I also recognize the efforts of all the NSBA staff mem- 
bers who prepared these articles for publication. 



Martin Semple 
Chairman 
Council of School Attorneys 
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An Overview of Student Testing and Assessment 

Naomi E. Gittins 

National School Boards Association 
Alexandria, Virginia 



INTRODUCTION 

Ensuring that students achieve to high 
standards is the mission of public education. 
How can we determine whether our schools 
are successfully meeting this goal and that 
student academic achievement is improving? 
Using a time honored tradition, many states 
have decided that testing students, using 
standardized examinations, to see how well 
they do is one of the assessment tools that 
makes the most sense. But what sounds like 
a simple educational proposition has turned 
out to be anything but that. The problems 
and controversies about student testing 
occur at many levels besides the educational, 
which by themselves are difficult enough. 
Overlaying the educational complexities that 
surround testing are other issues staggering 
in magnitude — political, social, economic, 
and legal. This book examines only the legal 
issues that should be considered in setting 
policy on the appropriate uses and conse- 
quences of student testing. It also explores 
the legal controversies that have already 
arisen or that could emerge when those 
policies are actually implemented. To help 
give context to the extended discussion in 
the following chapters, here is a short 
overview of some of the state and federal 
legislative and agency activities with legal 
implications for high-stakes student testing. 

A few brief state profiles are included as 
examples. 



STATE ACTIVITY 

Legislative and Regulatory Trends 

• Standards and Assessments 1 

Virtually all states have adopted minimum 
“standards” that outline what students need to 
know in certain academic areas. As of 1999 
more than 40 states were using student assess- 
ments (including norm-referenced tests, 
criterion-referenced tests, performance assess- 
ments and portfolios) to measure student 
achievement against these standards and to 
hold students, teachers, schools and districts 
accountable. (See Appendix F for state-by- 
state statutory references on standards and 
assessments.) To measure gains in achieve- 
ment, scores from tests are used separately or 
in combination with other factors. Some of 
these accountability systems compare the 
current year’s scores with previous year(s) in 
reference to a state or national standard. 
Comparisons may be made between classes at 
a certain grade level, among schools within a 
district or among different school districts. 
Group comparison is used more often than 
individual tracking because of the difficulties 
in measuring individual progress where stu- 
dent populations are mobile. 2 

1. This discussion of standards and assessment is based 
on information contained in Judith K. Mather, 
Performance-based Accountability Systems (Education 
Commission of the States, 1999). 

2. Only four states mandate collection of data on 
student mobility: Alaska, Colorado, Illinois and 
Nevada. Id. 
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STATES WITH LEGISLATIVE AUTHORITY 


STATES WITH LEGISLATIVE AUTHORITY 


TO REMOVE PRINCIPAL OF SCHOOL 


TO RECONSTITUTE, TAKE OVER OR 






CLOSE SCHOOLS 




Alabama 


Michigan 


Alabama 


Michigan 


Oklahoma 


Delaware 


Nevada 


Illinois 


Nevada 


Rhode Island 


Illinois 


New York 


Indiana 


New Mexico 


South Carolina 


Kansas 


North Carolina 


Kansas 


New York 


Texas 


Louisiana 


South Carolina 


Louisiana 


North Carolina 


Vermont 






Maryland 






Source: Education Commission of the States, State Education Leader , Vol. 18, No. 1 


(Winter 2000). Reprinted with permission 


irom ECS, 707 17th St., Denver, CO 80202-3427. © 2000 ECS. 









• District or School Rewards for Student 
Performance 3 

By 1999, 17 states were providing rewards 
to districts and schools based on gains in 
student performance. Most of the rewards are 
monetary, but some states also provide waivers 
from assessment rules and regulations. 4 Ex- 
perts disagree as to whether rewards are effec- 
tive, noting that they can motivate changes, but 
the improvements may be not be enduring. 
Rewards for student performance have also 
raised fairness issues, with many claiming that 
assessment measures such as scores on stan- 
dardized tests are not valid and reliable indica- 
tors of student achievement or the schooFs or 
district’s efforts to provide students with a high 
quality education. Some claim that one time 
monetary rewards are also limited in value 
because the appropriation of funds must be 
sustained over time if it is to enhance educa- 
tional quality in a real sense. 

• District or School Sanctions for 
Student Performance 5 

A far larger number of states (35) impose 
sanctions when students fail to achieve mini- 



3. Id. 

4. More commonly, waivers are granted not as a reward 
for improving student achievement but rather as a 
form of assistance to allow a school or district 
flexibility in attempting to raise student achievement 
through a plan or program that would be otherwise 
impermissible under current regulations. 

5. This discussion of possible sanctions relies on 

information contained in How States are Responding 

to Low-Performing Schools , State Education Leader, 

Vol. 18, No. 1, at 13 (Education Commission of the 

States, Winter 2000). 



mum standards. 6 Some of the more 
commonly used state sanctions are: letter 
of warning or notification; require develop- 
ment of improvement plan; placement on 
public list of low performing districts/ 
schools; 7 impose implementation of im- 
provement plan; provide additional funds; 
state assistance team or probation manager 
assigned; enrollment options, e.g ., permit- 
ting students in low performing schools to 
transfer; 8 loss of accreditation; reconstitu- 
tion; 9 reorganization (see box), takeover 10 
and school closure. 



6. Sometimes the sanctions or interventions are 
part of a school accreditation system that 
considers other indicators in addition to student 
achievement. These factors may include: 
attendance, drop-out rates, student discipline 
statistics and faculty credentials. 

7. Twenty-one states either currently or plan in 
the near future to issue overall ratings of 
their schools based largely on student perfor- 
mance as measured by student test scores on 
standardized tests and other academic indica- 
tors. Lynn Olson, Worries of a Standards 
u Backlash n Grow , Education Week, April 5, 

2000 . 

8. As of January 1999 seven states (Kentucky, 
Louisiana, Oklahoma, New York, North Carolina, 
Texas and West Virginia) allowed such transfers. 

9. The first reconstitution occurred in San Fran- 
cisco in 1983. Districts in at least six other 
states (Colorado, Illinois, Maryland, New York, 
Ohio, and Texas) have undergone reconstitution 
since then. 

10. Takeovers may be implemented in several 
different ways, e.g., changing a public school to a 
state-run charter school or shifting control to 
other local or state officials. 
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• High-Stakes Consequences for 
Students 11 

At least 27 states and Puerto Rico have 
enacted statutes requiring students to take high 
school exit examinations as a condition of 
graduation (see box). In some of these states 
the requirement does not become effective for 
several years. Administered at varying points 



STATES WITH HIGH SCHOOL GRADUA- 
TION EXIT EXAMINATIONS* 


Alabama 


Maryland 


Ohio 


Alaska 


Massachusetts 


Puerto Rico 


Arizona 


Minnesota 


South Carolina 


California 


Mississipi 


Tennessee 


Delaware 


Nevada 


Texas 


Florida 


New Jersey 


Utah 


Georgia 


New Mexico 


Virginia 


Hawaii 


New York 


Washington 


Indiana 

Louisiana 


North Carolina 


Wisconsin 


*As of July 31, 


1999 




Source: National Governor’s Association, as reprinted in 
Education Commission of the States, State Education leader, 


Vol. 18, No. 1, 


at 11 (Winter 2000) 


. Reprinted with 


permission from ECS, 707 17th St., 


Denver, CO 80202-3427. 


© 2000 ECS. 







during the high school years, these tests are 
sometimes used to determine which students 
need remediation to meet the standards. 
Students may be provided several opportuni- 
ties to pass the exams. A few states 12 use tests 
to award endorsed or honors diplomas. At 
least 17 states base promotion and retention 
decisions on state and/or district assessment 
(see box). 

• Teacher Evaluation 

At least five states (Delaware, Kansas, 
Kentucky, Tennessee, Texas) link teacher 
evaluation to student examination scores. 13 
Under Delaware’s new law student scores on 
state and local assessments count for at least 
20% of teacher and administrator perfor- 
mance reviews. 14 Other states are considering 
how to do this, but are finding it to be a 
difficult political issue. Some local schools do 
it to avoid losing state aid. For example, 
Baltimore, Maryland began using test scores to 
evaluate teachers in 1996. Under Baltimore’s 
system teachers are given training if their 
students do not improve after one year and 
are dismissed if student performance does not 



STATES THAT BASE PROMOTION AND RETENTION ON STATE 
AND/OR DISTRICT ASSESSMENT 



Arizona 

Arkansas 

California 

Colorado 



Connecticut 

Delaware 

Florida 

Illinois 



Louisiana 
Michigan 
Mississsippi 
New Mexico 



North Carolina 
Oklahoma 
South Carolina 
Texas 
Wisconsin 



Source: “State Student Promotion/Retention Policies,” ECS Web site, www.ecs.org . as reprinted in Education Commission 
of the States, State Education Leader , Vol. 18, No. 1, at 11 (Winter 2000). Reprinted with permission from ECS, 707 17th 
St., Denver, CO 80202-3427. © 2000 ECS. 



11. This discussion of high-stakes consequences for 

students is based on information contained in tables 
found in Education Commission of the States, State 
Education Leader, Vol. 18, No. 1, at 11 (Winter 2000). 



12. Massachusetts, Michigan, Ohio, New York and 
Tennessee. States Conducting Student Competency 
Testing for High School Graduation (Education 
Commission of the States, August 1999). 

13. Teacher Evaluation (Education Commission of the 
States Information Clearinghouse Sept. 1997); Joetta 
Sack, Del. Ties School Job Reviews to Student Tests, 
Education Week, May 3, 2000. 

14. J. Sack, Del. Ties School Job Reviews, supra, n. 13. 
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improve by the third year. 15 Of course, many 
districts discuss student performance with 
teachers during evaluations, but don’t direcdy 
link the two. 

• Alignment Problem 

State efforts to measure student achieve- 
ment through standardized tests and to hold 
the various stakeholders (students, teachers, 
schools and districts) accountable based on 
test scores have encountered many problems. 
In some states, one of the most troublesome 
from a legal and educational point of view, is 
the problem of non-alignment between stan- 
dards, assessments, rewards and sanctions. 

This happens when states use national stan- 
dardized tests or administer state examinations 
that are not linked to their standards or when 
standards and curriculum don’t match. This 
has resulted in part from the failure to coordi- 
nate the various pieces of legislation (which 
may have been enacted separately over time, 
rather than simultaneously) that mandate the 
content standards, require assessment, grant 
rewards and impose sanctions. 16 Others 
believe that the problem is largely caused by 
districts refusing to relinquish control over 
curriculum. Yet still others say it’s a matter of 
resources — with teachers being required to 
teach new curriculum without the necessary 
support such as appropriate textbooks, instruc- 
tional materials and training. i* This situation 
has caused many to cry foul. Their com- 
plaint, which in some cases has been made 
through legal channels, is that it is basically 
unfair to make decisions about student, 
teacher, school and district performance based 
on tests that cover content that students are 
not being adequately taught. To address these 
concerns monumental efforts must be made to 
readjust a school’s or district’s curriculum and 
to provide adequate resources so that teachers 



15. Teacher Evaluation, supra, n. 13. 

16. J. Mather, Performance-based Accountability 
Systems, supra, n. 1. 

17. Ann Bradley, Union Head Issues Standards Warning, 
Education Week, July 12, 2000. 







can properly prepare students for the materi- 
als on which they will be tested. 

Profiles of Selected State Testing Programs 

• Virginia 18 

In Spring 1998 Virginia began its Stan- 
dards of Learning (SOL) testing program, 
requiring students in the third, fifth, eighth 
grades as well as high school students to take 
state standardized tests in several subject 
areas. Schools are currendy issued “report 
cards” showing their pass rates. Beginning in 
2006-2007 schools must achieve 70% or better 
pass rates in each of the subject areas in 
order to be “accredited.” And starting with 
the class of 2004, high school students must 
pass at least six of the eleven tests in the high 
school SOL battery to graduate. 

The school pass rates emerging from the 
Spring 1999 administration of the tests 
showed that only 6.5% of the state’s schools 
met the standard for accreditation. Students 
likewise have passed the exams at surprisingly 
low rates. In 1998 only 54% of Fairfax 
County students — who have an average SAT 
score well above the national average and 
where 91% pursue post-secondary education — 
passed the battery of tests that will be a 
graduation requirement not too far in the 
future. 

Schools in Virginia, like in many other 
states who have embraced high-stakes testing, 
must also figure out what to do about dispari- 
ties between the test scores of minorities and 
those of white students. At least 66% of 
African American students compared to 30% 
of white students would not have received 
their diplomas in 1999 had the tests been 
used as a graduation requirement that year. 

Because of public concerns about the 
SOL’s fairness, the state asked a team of 
experts to evaluate them. The experts’ report 
released in 1999 found that the tests were 
valid and reliable, but the state has nonethe- 

18. This state profile draws heavily from material 
found in J. Dounay, High-stakes Testing Systems, 

ECS Policy Brief (March 2000). 
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less formed a technical advisory committee to 
review the tests annually and to propose 
recommendations for future change. The 
state board of education has also heard from 
state education organizations that voiced their 
concerns about using the test scores alone to 
determine student promotion and graduation. 
One change the state board of education 
approved in October 1999 with respect to 
school accountability is to allow schools to 
avoid reconstitution while making progress in 
raising student test scores. 

In July 2000 the Virginia Board of Educa- 
tion yielded to some of the demands for 
flexibility in the testing requirements for 
graduation but held fast to the standard that 
students pass tests in at least six content areas. 
Virginia now allows students to satisfy the 
testing criteria through performance on a 
wide range of tests in addition to the SOL 
tests. Among the approved alternative tests 
are examinations used in conjunction with the 
Advanced Placement program, International 
Baccalaureate program, College-Level Examina- 
tion Program, SAT-II, and Test of English as a 
Foreign Language. The board also gave 
additional flexibility to students currently in 
the seventh, eighth and ninth grades by 
allowing them to choose four of the six tests 
they must pass to graduate. It rejected pro- 
posals to delay the effective date of the testing 
requirement. 

• Massachusetts 19 

Massachusetts’s testing program, known as 
the Massachusetts Comprehensive Assessment 
System (MCAS), assesses student performance 
in fourth, eighth and tenth grade in language 
arts, math, science and technology. As in 
Virginia, these state standardized achievement 
tests have created controversy among various 
members of the education community and 
within the public mind as well. So much so 
that there have been student boycotts of the 
tests, establishment of parent protest groups 
and refusals by some teachers to administer 
parts of the tests that they felt were unfair. In 

19. Id. 



June 2000 several school boards registered 
their discontent, signing resolutions calling on 
the state to eliminate the MCAS as a graduation 
requirement. These dissenting boards say that 
they support high standards but do not believe 
that the test should be the sole criterion. 

Student test scores are used as a measure 
for both school and individual student perfor- 
mance. The state board of education rates 
schools on a two-year cycle, looking at overall 
performance against the state standards and 
progress on improvement goals set according 
to the school’s baseline performance. If the 
school has not made satisfactory improvement 
on those goals, it must submit to a review 
panel a report explaining its performance and 
a plan for improvement. The state has 
authority to take over schools that fail to 
meet their plan goals within two years of plan 
approval. Beginning in 2003, high school 
students will also be held accountable for 
their performance on the MCAS. Those who 
do not achieve a rating of proficient or above 
will not be eligible to graduate. Results from 
the 1999 tests show that only one third of the 
tenth grade students scored above the gradua- 
tion cut scores in language arts and barely a 
quarter did so on the mathematics, science 
and technology portions of the test. These 
scores on the state standardized tests are, like 
those in Virginia, inconsistent with the na- 
tional performance ratings of students in 
Massachusetts who do very well in compari- 
sons based on the National Assessment of 
Education Progress (NAEP) tests. In response 
the state board has now set passing scores on 
the MCAS exit examinations at levels some 
consider to be too low, with promises to raise 
them in future. 

Troublesome disparities between the scores 
of white students and those of minority 
students have also emerged in Massachusetts. 
An analysis done by a University of Massachu- 
setts group showed that on the 1998 MCAS 
math test for tenth graders, the overall failure 
rate for students of all races was 52%, but 
that percentage shot up to 83% for Hispanic 
students and to 80% for African American 
students. 
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• Texas 

The high failure rates of minority students 
on a state standardized examination used to 
measure mastery of the state-mandated cur- 
riculum was at the center of a lawsuit claim- 
ing that use of the test as a requirement for 
graduation unfairly discriminates against Texas 
minority students and violates their due 
process rights. The case, GI Forum v. Texas 
Education Agency 20 involved a challenge to the 
Texas Assessment of Academic Skills (TAAS) 
under the due process clause of the Four- 
teenth Amendment and under an implement- 
ing regulation to Title VI of the Civil Rights 
Act of 1964. Ultimately, the district court 
upheld the fairness of the test, first instituted 
in 1990 to replace an earlier state high school 
exit examination. While there was evidence 
that minority students have been, and to some 
extent continue to be, the victims of educa- 
tional inequality, there was not enough evi- 
dence to support a finding that the TAAS test 
was developed, implemented and used to 
impermissibly disadvantage minorities. 

Texas public school students begin taking 
the TAAS tests in the third grade and take 
the exit level examination in the tenth grade. 
Students who do not pass the three parts of 
the test in reading, mathematics and writing, 
have at least seven additional opportunities to 
take and pass the TAAS exam before their 
scheduled graduation date. The Texas Educa- 
tion Agency has set the cut score for the exit 
exam at 70% even though by its own projec- 
tions, setting the passing score at this level 
means much greater percentages of Hispanic 
and African American students will fail the 
test than their white counterparts. In spite of 
these projected disparities, the TEA deter- 
mined that objective measures of mastery 
should be imposed in order to eliminate what 
it perceived as inconsistent and subjective 
teacher evaluations of students. Schools are 
required to provided remediation to those 
students who fail to master any portion of the 
test although there is no state mandated 



20. 87 F. Supp.2d 667 (W.D. Tex. 2000). 







approach to remediation and so such efforts 
vary from district to district. 

Texas uses student performance scores on 
the TAAS exams to hold administrators, 
schools and teachers accountable. Texas is 
one of the few states in which teacher evalua- 
tions are directly linked to student test scores. 
The state disaggregates passing and failing 
scores into racial subgroups so that schools 
and districts are aware of the degree of suc- 
cess or failure of each subgroup. If one 
subgroup fails to meet minimum performance 
standards, a school or district will receive a 
low accountability rating. Under these cir- 
cumstances, it is no surprise that many Texas 
teachers and administrators feel a tremendous 
pressure to take drastic measures to raise their 
students' scores. Cases of cheating on state 
assessments in Houston, Austin and eight 
other Texas districts led to the creation of the 
Public Education Integrity Task Force. 21 

• California 

In 1998 the state initiated a new assess- 
ment and accountability system known as the 
Standardized Testing and Reporting (STAR) 
program that uses the Stanford Achievement 
Test Series for grades 2-11. However, the state 
board of education did not pass state stan- 
dards until 1999, thus requiring some efforts 
to retrofit the testing system to the newly 
established standards. The state has devised 
some augmented items to be used in conjunc- 
tion with the Stanford tests and is currently 
designing the California Assessment of Ap- 
plied Academic Skills. The state will also 
require beginning in 2004 that high school 
students pass an exit exam in language arts 
and mathematics in order to graduate. 

All students in grades 2-11 take the STAR 
tests. This includes students with disabilities 
unless they are specifically exempted by their 
Individual Education Plan (IEP) 22 and English 
language learners (ELL). 2 3 ELL students who 



21. As the use of high-stakes testing has increased, 

charges of cheating by teachers and administrators 
have arisen in other places such as New York City 
and Montgomery County, Maryland. 
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have been in school less than 12 months are 
also tested in their primary language. Stu- 
dents who have been in school more than 12 
months but are still classified as ELL may 
also be given a primary language test. As in 
other states assessment of ELL students is 
highly controversial. 24 The concern is that 
testing students in a language they do not 
understand prevents them from showing what 
they know and can do in content areas such 
as math and science. Some question whether 
primary language testing is the answer since 
some research shows that such testing helps 
only those who have been instructed in their 
native language. California law prohibits 
teaching ELL students in their primary 
language if they have resided in this country 
for more than a year. The quality of educa- 
tion provided to minority students in Califor- 
nia has also come under attack on other 
grounds. The American Civil Liberties Union 
filed a suit in August 1999 alleging that the 
state has failed to provide equal access to 
advanced placement courses in schools where 
the enrollment is predominantly lower-income 
African American and Hispanic students. 



22. According to a report by the National Center for 
Educational Outcomes many states are struggling to 
create alternative assessment systems and guidelines 
to determine which students should be considered 
for alternatives. 

23. English language learners is another name for 
students sometimes described as Limited English 
Proficient (LEP). 

21. For example, in Arizona minority at risk and ELL 
students brought suit in federal district court, 
alleging that the state violated the Equal Education 
Act of 1974, 20 USC 1703(f), by not providing 
adequate funding to carry out its Lau responsibili- 
ties and violated implementing regulations under 
Title VI of the Civil Rights Act of 1964 based on 
the disparate impact that the AIMS (Arizona 
Instrument to Measure Standards) test has on ELL 
students. In Flores v. State of Arizona , No. CIV 92- 
596-T-ACM, (D. Ariz. Jan. 24, 2000). The district 
court ruled in favor of the students on the Lau 
claim but rejected the Tide VI claim, saying that 
the plaintiffs had failed to establish the necessary 
causal link between their disproportionate failure 
on the AIMS test and their race. 



SOME FEDERAL DEVELOPMENTS 

The student testing and assessment phe- 
nomenon that has swept the nation has both 
sparked and been spurred by federal activity 
on testing. 

Title I 

Under Title I of the Improving America’s 
Schools Act of 1994 25 receipt of federal funds 
is conditioned upon the state’s development 
of standards and assessments that meet certain 
criteria, including the use of multiple mea- 
sures, assessments for children with different 
language backgrounds and measures of 
progress. Intended as a means to ensure state 
and district accountability for providing high 
quality education to all students (not to assess 
individual student performance), the statewide 
assessment system must be in place by the 
2000-01 school year. 

All students, including those who have 
limited English proficiency or disabilities, must 
be included in the assessments. The only 
students who may be exempted are those who 
have not attended schools in the local educa- 
tion agency for a full academic year. 26 To the 
extent practicable, ELL students must be 
assessed in the language and form most likely 
to yield valid results. For example, this might 
mean giving the student a bilingual (English 
and native language) version of the test. This 
determination must be made on an individual- 
ized basis but in no case may an ELL student 
be assessed against content or performance 
standards less rigorous or demanding than the 
standards applicable to all other students. 

The assessment systems must also include 
“reasonable adaptations and accommodations 
for students with diverse learning needs, 
necessary to measure the achievement of 
students relative to state content standards.” 
States must develop policies on testing accom- 
modations that include the range of accom- 
modations local officials may use, the types of 

25. 20 U.S.C. § 6301 et seq. 

26. 20 U.S.C. § 6311(b)(3)(F), (G). 
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students and the conditions under which each 
accommodation may be used, instructions on 
the proper use of each accommodation and 
reporting requirements that enable the state 
to track and evaluate the use of accommoda- 
tions. 

With respect to students with disabilities 
whose IEP or Section 504 teams determine 
that the standard state assessment is not 
appropriate, states must have a statewide 
alternate assessment system or a comprehen- 
sive state policy governing locally developed 
alternate assessments. (See Appendix B for 
reprint of memorandum from Michael Cohen, 
Assistant Secretary for the Office of Elemen- 
tary and Secondary Education, U.S. Depart- 
ment of Education, providing summary guid- 
ance on the inclusive requirement of Tide I 
assessments.) 

Individuals with Disabilities Education Act 

Likewise the 1997 Amendments to the 
Individuals with Disabilities Education Act 27 
(IDEA) require states and districts to include 
students with disabilities in their assessment 
programs to the extent appropriate and also 
to report scores. Ultimately, it is a decision for 
the IEP team as to whether a student with a 
disability should be included in broad scale 
tesdng. If the decision is to exempt the 
student, the IEP must state the reasons sup- 
porting such a decision. 

States had until July 2000 to develop 
guidelines for alternative assessment systems 
for students who are deemed unable to take 
the regular tests even with accommodations. 
The federal government has provided only 
vague guidance to the states on how to set up 
such systems and has left decisions about 
eligibility, types of assessment, scoring of tests, 
reporting of scores and alignment of alterna- 
tives with state academic standards to state 
officials. In addition, states must cope with 
development of procedures to implement the 
alternative assessments, training of staff, many 
of whom may be uncertified, and how to 



27. 20 U.S.C. § 1400 et seq. 
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handle students who cannot take the standard 
assessments, even with accommodations, but 
who do not fit the criteria for alternative 
assessments. 28 

For those who do receive accommoda- 
tions, there is concern that too many accom- 
modations to take the exams will make the 
test scores hard to interpret as a measure of 
the students’ actual achievement both on an 
individual and aggregate level. In addition, 
over-accommodation may be detrimental to 
efforts to help such children become inde- 
pendent. (See K. Mehfoud, Special Consider- 
ations in Testing , infra , at 4 -1 for more infor- 
mation about testing students with disabilities 
and English language learners.) 

National Research Council Study 

In 1997 in response to the proposals for 
voluntary national testing and the increasing 
use of large scale testing by states for high- 
stakes purposes, Congress asked the National 
Academy of Sciences through its National 
Research Council for guidance on the appro- 
priate and nondiscriminatory use of such 
tests. 29 The Council’s Board on Testing and 
Assessment issued its report to Congress in 
1999. High-stakes Testing for Tracking, Promotion 
and Graduation (J. Heubert and R. Hauser, 
eds., National Academy Press). Among the 
recommendations contained in the report are 
the following: 

• Accountability for educational outcomes 
should be a shared responsibility of 
states, school districts, public officials, 
educators, parents and students. 

• Those who share this responsibility 
should have access to information about 
the nature and interpretation of tests 
and test scores. 

• Tests may be appropriately used to lead 
to curricular reform but should not be 

28. Joetta Sack, Alternate-Test Plans Prove Challenging 
Education Week, June 21, 2000. 

29. Pub. L. 105-78, Sec. 309. 
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used to make high-stakes decision 
about individual students until test 
users can show that the test measures 
what the students have been taught. 

• Test users should avoid simple either-or 
options for students doing poorly in 
school in favor of strategies combining 
early intervention and effective 
remediation. 

• High-stakes decisions, such as tracking, 
promotion and graduation should not 
be made on the basis of a single test 
score. 

• Students should be provided with 
sufficient test preparation. Adequate 
test preparation for a particular test 
format should be balanced against the 
possibility that excessively narrow 
preparation will invalidate test out- 
comes. 

• High-stakes testing programs should be 
roudnely evaluated. Policymakers 
should monitor the consequences of 
high-stakes assessments on all students 
and on significant subgroups such as 
minorides, English language learners 
and students with disabilides. 

(See Appendix C for more of the Council's 
recommendadons and potendal strategies 
for appropriate test use.) 

Proposed Legislation 

Some of these recommendadons appear 
in a bill introduced in Congress called the 
Fairness and Accuracy in Student Tesdng 
Act. 30 The bill requires that a state or local 
educadon agency receiving funds under the 
Elementary and Secondary Educadon Act of 
1965 meet certain condidons to use stan- 
dardized test performance as part of its 
decision making about retendon, gradua- 
tion, tracking or within-class ability grouping 



of an individual student. Among the condi- 
dons are: 

• test scores cannot be the sole determi- 
nant or be assigned determinative weight 
in making the decision; 

• test must be valid and reliable for the 
purpose for which the results are being 
used; validity and reliability of cut scores 
must be established; 

• test must be based on state or local 
content and performance standards; test 
must be aligned with curriculum and 
classroom instruction; 

• students must be provided multiple 
opportunities to demonstrate proficiency 
in the subject matter tested; 

• test must be administered in accordance 
with written guidance from test developer 
or publisher; 

• accommodations and alternative assess- 
ments must be available for students with 
disabilities; 

• appropriate accommodations must be 
provided to ELL students; and 

• state or local education agency must 
conduct comprehensive evaluation of the 
test's use on students' education and 
educational outcomes; agency must pay 
particular attention to impact on students 
disaggregated by socio-economic status, 
race, ethnicity, limited English proficiency, 
disability and gender. 

These bills have been referred to the 
education committees in both houses. 

OCR Guidance 

In response to numerous requests for 
technical assistance and a rising number of 
discrimination complaints in the area of 
testing and assessment, the Office for Civil 
Rights (OCR) of the U.S. Department of 
Education is developing a resource guide for 



30. H.R. 4333 and S. 2348, 106th Cong., 2d Sess. 
( 2000 ). 
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educators and policymakers on the use of tests 
when making high-stakes decisions for students 
such as graduation, promotion, and placement 
in special programs. Intended as a practical 
document to help in the planning and imple- 
mentation of testing policies, the guide in its 
current draft form contains discussion of test 
measurement principles in the context of 
sound educational policy and practice and 
their relationship to compliance with the 
federal nondiscrimination statutes enforced by 
OCR. The guide’s discussion of test measure- 
ment principles draws heavily from the Stan- 
dards for Educational and Psychological Testing 
(Joint Standards) which were recently revised by 
a joint committee of the American Educational 
Research Association, the American Psychologi- 
cal Association and the National Council on 
Measurement. The chapter on legal issues 
emphasizes the nondiscrimination require- 
ments under Title VI of the Civil Rights Act of 
1964 (race and national origin), Section 504 
of the Rehabilitation Act of 1974 (disability), 
Title II of the Americans with Disabilities Act 
(disability) and Title IX of the Education 
Amendments of 1972 (gender). The analysis is 
divided between different treatment and 
disparate impact. There is a separate discus- 
sion of special issues related to students with 
disabilities and a brief examination of some of 
the constitutional principles that apply. The 
guide also includes a glossary of legal terms, a 
compendium of legal concepts that are rel- 
evant to student testing programs and a list of 
other resources and references. Currently in 
draft form, 31 the final document is expected to 
be published in September 2000. 



31. The draft is currently available at http://www.ed. 
gov/ offices/OCR/ testing 



CONCLUSION 

As states have raised the stakes associated 
with student performance on standardized 
tests, controversies have emerged that have no 
easy solutions. Many believe that the current 
testing phenomenon reflects our misguided 
obsession with measuring the outcomes of our 
education system (student performance) 
without first dedicating the resources, in terms 
of time, effort and money, for students to 
have any real chance of achieving the stan- 
dards we set for them. Others say that setting 
standards and attaching high-stakes conse- 
quences to meeting those standards will drive 
reforms within public education that will 
ensure that students do achieve academic 
proficiency. 

Despite these controversies, one thing is 
clear — the current emphasis on standards and 
high-stakes testing is unlikely to disappear any 
time soon. This makes it imperative that 
school leaders make sound educational deci- 
sions about test use consistent with the re- 
quirements of the law. 
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A Framework for Addressing Student 
Test Use Issues* 

Arthur L. Coleman 
Nixon Peabody, LLP 
Washington, D.C. 



INTRODUCTION 

In the last decade, accountability became 
the driving force behind public education 
reforms. Echoes of “are my child’s teachers 
qualified?,” “does my child’s school measure 
up to others?” and “does my child know 
enough?” continue to reverberate in conver- 
sations about the performance of our schools 
nationwide. Few educators dispute the impor- 
tance of accountability in education, but 
many vigorously disagree about how to 
ensure accountability for students, teachers, 
and schools — especially when the subject of 
test use arises. 

Education reforms that rely on high- 
stakes tests to measure student achievement 
have increased the focus on the legal param- 
eters and educational principles that should 
guide decisions about student promotion and 
high school graduation. This article will 
describe the central legal and educational 
principles that should guide the development 
and implementation of tests used in these 



An earlier version of this article appeared in Inquiry 
& Analysis (NSBA Council of School Attorneys, Mar. 
1999). Segments of this article are taken from: 
Coleman, Excellence and Equity in Education: High 
Standards for High-Stakes Tests , 6 Va. J. Social Pol. 8c 
L. 81 (Fall 1998), which provides more extensive and 
detailed coverage of the issue. Citations to the legal, 
educational and psychometric principles set forth 
herein can be found in that article. Segments used 
with permission. Readers interested in this issue also 
should review: National Research Council, High 
Stakes: Testing for Tracking, Promotion, and 
Graduation. (J. Heubert & R Hauser, eds., 1999). 

See Appendix D for other related resources. 



critical, high-stakes decisions. Ultimately, the 
lessons to be derived from federal jurispru- 
dence, educational research and practice, and 
psychometric principles are as simple as they 
are important. They are: 

• The determination of whether a test is 
valid cannot be made without knowing 
the educational objective and purpose of 
the test. A test may be valid for one 
situation or use and invalid for others; 
likewise it may be valid when measuring 
the performance of some test-takers and 
not others. 

• Any test with high-stakes consequences 
for students must be valid for its particu- 
lar use. This means tests designed to 
measure learning and used to determine 
high school graduation or advancement 
from grade to grade must fairly measure 
what the student has been taught. As a 
general proposition, there must be 
alignment among the curriculum, in- 
struction and material covered on such 

a high-stakes assessment. Issues related 
to accommodations or alternative testing 
for students who are not fully proficient 
in English and students with disabilities 
must also be examined with care to 
ensure that the testing of these students 
corresponds to their particular educa- 
tional programs. 

• Test results can provide very valuable 
guidance in deciding whether to pro- 
mote a student. However, a test is only 
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one tool among many. Many factors — 
some relating to what a test measures, 
and some not — may affect a student’s 
performance on a particular test. Not 
surprisingly, therefore, psychometric 
standards confirm that a single test score 
should not be used as the sole criterion 
to make high-stakes educational deci- 
sions, such as high school graduation or 
promotion from grade to grade. 

• The existence of statistically significant 
disparities in test scores among discrete 
student populations ( e.g minority-non- 
minority; urban-rural) should cause 
further inquiry and examination regard- 
ing the relevant educational program 
and associated testing practices. * Group 
differences in test scores may reflect a 
range of causal factors unrelated to the 
test, such as lack of preparation for the 
test, or they may reflect a problem with 
the test itself. Strategic and systematic 
monitoring of results over time can help 
prevent (and cure) problems of test bias, 
other discriminatory practices, or pro- 
grammatic deficiencies affecting student 
learning. 

EDUCATIONAL RIGHTS AND RESPON- 
SIBILITIES: SOME FOUNDATIONS 

Several fundamental principles underlie 
the framework of questions policy makers and 
educators should consider to ensure testing 
practices appropriately influence educational 
decisions about student advancement. These 
guiding principles have: (1) influenced an 
array of federal court decisions since the 
1954 decision in Brown v. Board of Education] 1 
(2) shaped federal educational policy since at 
least 1993; and (3) informed the policy 
choices in numerous states and locales that 
have grappled with standards-based reform 
issues. 



1. 347 U.S. 483 (1954). 







Principle One: The basic obligation of public 
educators is to teach students, with their 
different backgrounds and abilities, the 
knowledge and skills they need to progress 
academically and mature with meaningful 
expectations of a productive life . 

Simply put, the job of the public educator 
is to prepare students to be successful mem- 
bers of society. This educational mandate is 
no less present during classroom instruction 
than it is when educators administer, evaluate 
and act on student test results. Tests, in short, 
should be instruments used by educators to 
accomplish this goal. Thus, policy makers and 
the education community must work to guar- 
antee that the establishment of high standards 
for all students does not unfairly deny educa- 
tional opportunity to any one student. 

Principle Two: The goals of guaranteeing 
excellence by promoting high academic stan- 
dards and ensuring that all students have 
fair opportunities to achieve success in public 
education are inseparable, mutually depen- 
dent goals. 

The goals of excellence through high 
academic standards and the opportunity for 
students to achieve success in public education 
are corresponding, coextensive goals — not, as 
some maintain, irreconcilable objectives. 

Policy makers and the education community 
should approach the issues regarding the use 
of high-stakes tests with a commitment to 
ensure that all students can learn in an envi- 
ronment in which standards are high and in 
which “all” means “all.” They must appropri- 
ately address the right questions when design- 
ing and implementing test-related reforms so 
that all students receive the opportunity to 
learn to standards that reflect the curriculum 
and instruction, and that are measured fairly. 

Principle Three: If it doesn’t make educa- 
tional sense, it can’t make legal sense under 
federal civil rights laws. 

The legal standards that guide the enforce- 
ment and adjudication of federal civil rights 
laws related to student access to education are 
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ultimately premised on educational judg- 
ments — not some abstract legal theory. For 
instance, in claims of race, national origin or 
sex discrimination in education, the disposi- 
tive questions posed by the courts center 
upon the adversity to a student’s (or group 
of students’) educational opportunity. The 
courts focus on the foundations for the 
educational decision-making leading to the 
challenged practice. 

FEDERAL LEGAL STANDARDS AND 
INQUIRIES 

The parameters that should guide educa- 
tors in the design and use of high-stakes tests 
can be found in an array of federal cases 
raising race discrimination and due process 
challenges in the elementary and secondary 
context. The legal inquiry under a typical 
(non-intentional) race discrimination chal- 
lenge is: 

• Does the use of the test cause an ad- 
verse racial impact: does the use of the 
test cause a denial of educational oppor- 
tunities to students of one race in 
numbers significandy disproportionate to 
the numbers eligible for the educational 
opportunity being denied — when com- 
pared to students of other races? and 

• Is the use of the test educationally 
necessary: is the test valid for the par- 
ticular purpose used and, if so, are 
there feasible, less discriminatory alterna- 
tives that would as effectively serve the 
educational objectives identified by the 
school? 

The federal due process inquiry typically 
centers on the sufficiency of the notice of 
the testing requirement and the opportunity 
that students have had to learn the material 
tested. The legal inquiry is: 

• Are students denied educational benefits 
or opportunities to which they have a 
legitimate claim, based on their test 
scores? In legal terms, is there a denial 



of a property right or liberty interest? 
and 

• Have the students been provided with a 
fair opportunity to pass the test? More 
particularly, is the test valid for the 
purpose used? 

Whether it is a claim of discrimination or 
a broader claim of unfairness in the test 
design or in test-related practices, the ultimate 
question upon which the typical federal legal 
analysis hinges is one of educational suffi- 
ciency: is the test valid for the purposes used 
for all students taking the test? The questions 
in each instance are, in simple terms: Does 
the test do what it is intended to do? Are the 
inferences derived from test scores and the 
educational judgments based on those infer- 
ences appropriate? 

The federal courts have also affirmed 
some psychometric principles that demonstrate 
the highly complex, contextual and fact- 
intensive nature of the inquiry regarding the 
validity of the use of test results. Accordingly, 
no one-size-fits-all formula can guide decision 
making regarding test use — particularly with 
regard to high-stakes decision making, about 
how to link test results to student promo- 
tion from grade to grade. However, prin- 
ciples derived from the federal cases dealing 
with the issue of standards, equity and high- 
stakes test use can be distilled into questions 
that should be thoroughly examined and 
addressed when schools consider high-stakes 
testing. 

FRAMEWORK TO GUIDE THE USE OF 
HIGH-STAKES TESTS 

What are the educational objectives or justifi- 
cations that support the use of a particular 
test? 

Before any state or district administers any 
test, the objectives of the testing exercise 
should be clear: what are the goals for and 
uses of the test? The answer to this question 
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will guide all other relevant inquiries about 
whether the test is educationally appropriate. 
Judgments about a test’s validity are impos- 
sible, and the conclusions one may draw from 
the results are meaningless, absent clear test 
use objectives. For example, one cannot 
conclude that a particular test of ninth grade 
science proficiency is, as a general proposi- 
tion, a valid test. A test of student achieve- 
ment and learning fully aligned with District 
A’s ninth grade science curriculum and in- 
struction and validated according to profes- 
sional standards for such use may be an 
appropriate consideration when making pro- 
motion decisions about students taking the 
test. Conversely, that same test, if used as a 
tool to make special education decisions or 
used to determine college admissions, would 
neither pass legal nor psychometric scrutiny. 

In fact, the test would not necessarily be 
appropriate to use with District B to measure 
ninth grade science proficiency absent a 
determination that material tested was fairly 
representative of the material taught in Dis- 
trict B’s ninth grade science courses. 

Not surprisingly, compliance with federal 
legal standards related to due process and 
discrimination protections rests, in the first 
instance, upon the educational judgment 
about the purpose and use of the test. Cen- 
tral to this inquiry is the issue of whether 
high-stakes consequences attach to the test 
results at all. Is the test designed to monitor 
a student’s or school’s performance and to 
inform educators over time about the need 
for additional resources? Or, is the test de- 
signed as a gatekeeper, so that failure has 
particular and real consequences for the 
student taking the test? Absent a denial of 
educational opportunity to particular students, 
federal constitutional or statutory protections 
designed to ensure fair and equitable opportu- 
nities for students are less likely to be trig- 
gered. Even if a test is not high-stakes, civil 
rights protections may apply where the oppor- 
tunity to take the test is itself an educational 
benefit, and that opportunity is denied on the 
basis of race, national origin, gender or 
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disability. If the test use in question has 
high-stakes consequences, then the ultimate 
legal inquiry parallels the ultimate educa- 
tional question: Do the test results at issue 
allow for informed and consequential deci- 
sions to be made about students taking the 
test? In other words, are the conclusions 
derived from the test results valid? 

Is there alignment between the material 
tested and the curriculum and instruction? 
Have the students who must achieve to a 
schooVs academic standards been provided 
the necessary instruction to give them a fair 
opportunity to do so? 

As instruments designed to inform stu- 
dents, parents and educators about success in 
meeting educational goals, high-stakes tests 
should be developed and administered as 
part of the educational effort to improve 
student performance; they should be inte- 
grated into the overall educational program. 
When tests (as measures of outcomes) are 
aligned with the curriculum and instruction 
(the educational inputs), they will more 
likely be measures of classroom learning. 
When the content of a high-stakes test 
reflects actual classroom instruction and the 
curriculum of an educational system, appro- 
priate test use can help achieve educational 
excellence for all students. In addition, 
when tests provide a fair basis for making 
decisions regarding student graduation or 
promotion from an educational standpoint, 
the more likely they will pass legal scrutiny. 

In the leading case on the subject, Debra 
P. v. Turlington , 2 the Fifth Circuit Court of 
Appeals ruled that a state proposing testing 
requirements as a condition of high school 
graduation was obligated to determine if the 
material being tested was actually being 
taught. The court found the test “was 
fundamentally unfair in that it may have 
covered matters that were not taught in the 
schools of the state .” 3 Recently, the federal 

2. 644 F.2d 397 (5th Cir. 1981). 

3. Id at 403. 
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district court in G.I. Forum v. Texas Education 
Agencf reaffirmed the notion that there must 
be more than theoretical coverage of the 
curriculum on such tests. In that case, the 
court partially based its ruling in favor of the 
State of Texas on the evidence showing that 
“the [challenged] test measures what it pur- 
ports to measure . . . [and] all students in 
Texas have had a reasonable opportunity to 
learn the subject matters covered by the 
exam.” 5 

This alignment issue (which is, ultimately, 
an opportunity to learn issue) is decidedly 
more complex for students who are not fully 
proficient in English or who have disabilities. 
A related yet distinct set of accommodations 
and alternative testing issues surface as high- 
stakes testing includes these students. 

For students who are not fully proficient 
in English (often referred to as English lan- 
guage learners or limited English proficient 
students), educators must ensure that assess- 
ment practices mesh with the instructional 
programs designed to (1) address the stu- 
dents’ language abilities and (2) allow for 
content mastery. In short, the way in which 
these students are assessed for high-stakes 
purposes cannot be divorced from the overall 
theory of the educational program provided: 
symmetry between the educational program 
(whether it is some form of bilingual instruc- 
tion or immersion, for instance) and the 
corresponding testing practices must exist. 

Similarly, disabled students who, under 
federal law, should be educated pursuant to 
an individualized educational program, must 
be assessed in a way that is consistent with 
that program. This means, at a minimum, 
that the high-stakes testing practices (and the 
drive toward inclusion of all students in 
statewide and district-wide programs) cannot 
be imposed without regard to the terms of 
these individualized programs and the particu- 
lar needs of the students individually. See K. 
Mehfoud, Special Considerations in High-Stakes 

4. 87 F. Supp.2d 667 (W.D. Tex 2000). 

Id. at 681-82. 



Testing , infra , at 4-1 for more discussion of 
testing disabled students.) 

When establishing high-stakes conse- 
quences for students, the issue of notice about 
the policy changes that will have significant 
consequences for students is paramount. One 
federal court has observed: “No one could 

seriously contend that academic requirements 
could never be changed during the twelve 
years a child typically spends in school. It is 
also obvious that [high-stakes test] 
requirements] could not . . . be[] constitu- 
tionally imposed a day prior to graduation. 
Such late notice could serve no academic 
purpose.” 6 The timing between the establish- 
ment of educational objectives related to 
student advancement and the actual imposi- 
tion of high-stakes consequences attached to 
student performance (frequendy based on test 
scores) is important. Clearly, there's a rela- 
tionship between the notice provided and the 
kind of opportunity that students have had to 
master the material or skills tested. Once 
again, the central legal (and educational) 
inquiry regarding the students’ opportunity to 
learn drives the analysis. (See R.C. Wood, Due 
Process and Discrimination Issues , infra , at 3-1 for 
more discussion of due process issues.) 

How are new test requirements implemented 
and interpreted to ensure appropriate educa- 
tional decision making? 

A series of protections may be integrated 
into the administration of a test and the 
interpretation of test scores to help eliminate 
the risk of inappropriately denying (or confer- 
ring) educational opportunities to students 
based on test scores. Those steps include: (1) 
establishing compensatory or tutorial supports 
to ensure that all students have the same 
basic and fair opportunity to master the 
material tested; (2) providing multiple oppor- 
tunities to take the test; (3) considering 
academic factors in addition to the test scores 



6. Anderson v. Banks, 520 F. Supp. 472, 506 (S.D. 
Ga. 1981). 
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that may affirm or challenge the high-stakes 
conclusions derived from the test score alone; 
and (4) using appropriate accommodations to 
meet the needs of disabled or limited English 
proficient students. 

These are all highly contextual practices 
that inform in very real terms the ultimate 
fairness of any test instrument. These kinds 
of protections are premised in part on a 
recognition that tests are not perfect barom- 
eters of learning, and similarly, that conclu- 
sions based on those results are not error-free. 
Given the inevitable presence of the human 
element in teaching and in learning, there is 
no guarantee that each student will have 
received all of the instruction necessary to 
provide a fair chance for success on high- 
stakes tests, even where there is alignment 
among standards, curriculum, instruction and 
high-stakes tests. Nor is there any foundation 
upon which one could reasonably conclude 
that all students can demonstrate what they 
have learned and what they know equally well 
on standarized tests. The principles of psycho- 
metrics in education recognize these points. 
Simply, there is no magic to any particular 
test score. Therefore, there should be ample 
protections in place to ensure that students 
are not inappropriately denied meaningful 
educational opportunities as a result of perfor- 
mance during a single test administration. 

What evidence exists regarding the adverse 
impact of the high-stakes decision making 
upon discrete groups of students ? What 
history and continuing effects of racial segre- 
gation or discrimination exist in the state or 
school district in question? 

The perfect test is not, and cannot be, the 
enemy of the good test when evaluating and 
implementing testing practices. Nonetheless, 
psychometrically sound testing practices and 
conclusions must guide the development, 
administration and use of tests designed to 
further high-standards learning. The evalua- 
tion of test results is necessarily a dynamic 
one: test results can change over time based 



on a range of factors that may affect those 
results, such as teaching practices, methods of 
test administration, and student limitations 
and needs. As a result, careful monitoring of 
test inputs and outcomes over time is critical. 
Importandy, educators should regularly moni- 
tor test results to determine if there are 
significant disparities among student groups, 
based on race, national origin, gender or 
disability. Psychometrically and legally, the 
presence of such disparities should lead to 
further investigation so that any potential bias 
or discrimination in the test use can be 
eliminated. 

Similarly, a state’s or district’s history of de 
jure segregation is central in any determina- 
tion regarding the imposition of new high- 
stakes criteria. Such test instruments have the 
potential to perpetuate (rather than elimi- 
nate) continuing effects of past discrimination. 
One court, in this context, has noted that the 
otherwise legitimate aim of improving educa- 
tional standards cannot be viewed in isolation 
when a school district has a de jure history and 
a subsequent policy of student tracking that 
perpetuated the segregation. The point is 
this: If a school system denied full and equal 
opportunity to a particular group of children 
by not providing them a level of instruction 
that ensures a meaningful opportunity to 
succeed, then the efforts to implement high- 
stakes measures with impact on individual 
students must not perpetuate the vestiges of 
that prior discrimination. 

CONCLUSION 

As the cases involving discrimination and 
“unfairness” claims reveal, the educational 
interest is at the heart of the examination of 
whether tests are being used in legally appro- 
priate ways. This legal and educational align- 
ment suggests, therefore, a basic foundation 
for concluding that the promotion of excel- 
lence in education can be and should be fully 
consistent with the promotion of equity and 
fairness in education. In short, it may be 
useful to think about the fundamental point 
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of this discussion in very simple (albeit decep- 
tively simple) terms. One court has noted: 

“An invalid test cannot measure merit.” 

The relative simplicity of the framework 
proposed in this article does not suggest that 
the issues to be addressed are simple or easy. 
The challenges facing educators struggling to 
improve the quality of their schools and 
student performance are many and complex. 
How can we improve the quality of teaching 
to better prepare our students for a new 
century? How can we ensure that schools 
facing some of the most significant challenges 
have the necessary resources to help all 
students achieve to high standards? One step 
in the right direction is to move beyond the 
frequently polarizing rhetoric to address the 
real issues educators face and to identify 
frameworks and strategies that can be pursued 
to accomplish the twin educational aims of 
high standards and fairness. 

High standards for academic achievement, 
when coupled with instruction and support 
that help students reach those standards — as 
determined by valid and reliable assessments — 
can unite students, parents, teachers, adminis- 
trators and community residents around the 
shared goal of improving student performance 
for all. By defining what students should 
know and be able to do, standards keep 
schools focused on the desired results for 
students — and can stimulate local development 
of appropriate curricula and application of 



effective teaching strategies to make these 
results possible and challenge students to 
learn. Standards also indicate what assess- 
ments must measure in order to show achieve- 
ment. 

Although the questions that arise about 
standards and equity will be as different as the 
students served by our public schools, a 
fundamental point should not be lost: all 
students need an educational system that 
expects high performance and that offers real 
and meaningful educational opportunities. As 
U.S. Education Secretary Riley has often said, 
“All students have the right to a high quality, 
high standards education. Nothing less will 
do.” 
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INTRODUCTION 

The trend that began more than 25 years 
ago with the experimental use of perfor- 
mance-based testing in education recently has 
become a tidal wave of reform that is sweep- 
ing the country. The ubiquitous movement 
toward standards and assessment is a product 
of many forces, most powerfully the demand 
for accountability. Many parents, community 
leaders, employers and school boards dissatis- 
fied with student literacy see performance- 
based testing as the most effective way to 
increase both the quality of education and the 
level of student achievement. In response to 
these concerns, state legislatures are under 
increasing pressure from various factions to 
mandate higher standards and to hold educa- 
tors accountable when these standards are not 
met. Legislators are responding by passing 
sweeping educational reform statutes that in 
many cases have significant consequences for 
schools. (See N. Gittins, An Overview of Student 
Testing and Assessment , supra, at 1-1 for a 
summary of state and federal activity on 
student testing.) 

High-stakes tests have aroused emotional 
debate in recent years, but more importantly 
they implicate a multitude of legal issues. 
While case law on the issue is limited, there is 
a core body of law that is useful in predicting 
legal outcomes. Because some states (particu- 
larly Florida and Texas) were ahead of the 
pack in tying accountability and consequences 
to assessment, their experiences offer insight 
into how courts likely will approach the issues. 



In the realm of educational accountability, 
the higher the stakes, the greater the risk of 
legal action. In districts where core knowl- 
edge is measured, but not tied to any specific 
consequences, there is litde legal impact. 

When students who complete required courses 
are nevertheless denied high school diplomas 
because of low test scores, and teachers are 
given unsatisfactory evaluations based on those 
scores, the landscape becomes ripe for litiga- 
tion. 

The most prevalent issues arising out of 
the use of high-stakes testing include racial 
and sex discrimination, due process concerns, 
fundamental fairness and test validity, testing 
of disabled students, adverse employment 
action for teachers and issues of collective 
bargaining. While some of these issues have 
yet to be addressed by the courts, sufficient 
legal precedent exists upon which to draw 
some general conclusions. 

LEGAL BACKGROUND 

States 9 Rights and Responsibilities 

States have undisputed power to educate 
their citizens. The Tenth Amendment to the 
Constitution reserves powers not expressly 
delegated to the federal government to the 
states and their citizens, and all states have 
exercised this authority by creating and main- 
taining systems of free public education. State 
constitutions typically provide plenary power 
over education. 
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The Supreme Court has been reluctant to 
interfere with the states’ rights to oversee 
education and usually will do so only when 
“necessary to protect freedoms and privileges 
guaranteed by the United States Constitution.” 1 
In applying rational basis scrutiny to decisions 
about education, the Court has stated that “the 
education of the people in schools. . .is a 
matter belonging to the respective States, and 
any interference on the part of Federal author- 
ity with the management of such schools 
cannot be justified except in the case of clear 
and unmistakable disregard of the rights se- 
cured by the supreme law of the land.” 2 

Because states have a crucial interest in 
maintaining the quality of education for their 
citizens, they may impose the standards they 
deem necessary, as long as the standards are 
not arbitrary or capricious. So long as an 
action does not violate the U.S. Constitution, a 
state “may determine the length, matter and 
content of any education it provides,” including 
the use of high-stakes testing. 3 

LEGAL IMPLICATIONS OF HIGH- 
STAKES TESTING 

Equal Protection 

Testing linked to adverse consequences 
inevitably increases the potential for discrimina- 
tion claims by affected student groups. Statis- 
tics show female and minority students rou- 
tinely score lower on standardized tests than do 
white males. Adveresly affected students some- 
times argue that disparities in test scores result 
from illegal discrimination. Legal claims may 
be brought both under the Equal Protection 
Clause and under Title VI of the Civil Rights 
Act of 1964. Title VI in particular requires 
that educational institutions “take steps to 
insure that test materials. . .are selected and 
administered in a manner which is nondiscrimi- 
natory in its impact.” 4 

1. Debra P. v. Turlington, 644 F.2d 397, 402 (5th 

Cir. 1981). 

2. Id. (citing Cumming v. Board of Educ., 175 U.S. 

528 (1899)). 

3. Id. 

4. 42 U.S.C. § 2000d et seq. (1996). 



It is difficult to show discriminatory intent 
in legislation requiring minimum competency 
standards in education. However, experience 
has demonstrated that discriminatory impact is 
often an unfortunate result of such tests. One 
recent study, commissioned by the Harvard 
Civil Rights Project, charged that high-stakes 
tests in some states actually detracted from 
learning, especially for minority and low 
income students. 5 

In Texas, a lawsuit challenging that state’s 
testing program filed by the Mexican Ameri- 
can Legal Defense and Education Fund in 
1997 alleged that “Hispanic and African 
American students concentrated in under- 
funded school districts have not been afforded 
an equal opportunity to learn the material 
covered on the graduation exam,” 6 The 
lawsuit alleged that “minority students have 
failed the test and been denied diplomas in 
disproportionate numbers.” 7 However, in early 
2000 the federal judge hearing the case re- 
jected the charges that the Texas test is unfair 
to black and Hispanic students, including 
those in poor communities, and dismissed the 
case. 8 Similar cases are pending in North 
Carolina and Nevada where plaintiffs allege 
discrimination based on race and national 
origin. 9 As in other circumstances, courts will 
look to motive, justification, and the impact of 
past discrimination to assess whether or not 
present practices are discriminatory. 

The seminal case in this area of law is the 
Fifth Circuit decision in Debra P. v. Turlington . 10 
The case arose after passage of Florida’s 
Educational Accountability Act of 1976, in 
which the legislature established new require- 
ments for graduation from Florida’s public 



5. Harvard Civil Rights Project, The Development and 
Impact of High-stakes Testing (1999). 

6. Kenneth J. Cooper, Standardized Exam Faces Test in 
Texas , The Washington Post, Sept. 22, 1999, at 
A19, (referencing G.I. Forum v. Texas Education 
Agency prior to the January 7, 2000 decision). 

7. Id. 

8. G.I. Forum v. Texas Education Agency, 87 F. 
Supp.2d 667 (W.D. Tex. 2000). 

9. Cooper, supra , n. 6. 

10. Debra P., 644 F.2d 397. 



Student Testing and Assessment 



3-3 



schools. Among other things, the Act (as 
amended in 1978) required students to pass a 
functional literacy test in order to receive a 
high school diploma. Functional literacy was 
defined as “the ability to apply basic skills in 
reading, writing and arithmetic.” 11 

On the first administration of the test, 

78% of black students failed one or more 
sections, while only 25% of white students 
failed some portion. 12 Minority plaintiffs filed 
a class action suit challenging Florida’s exam 
on several constitutional and statutory theo- 
ries including the Equal Protection and Due 
Process Clauses of the Fourteenth Amend- 
ment, Title VI of the Civil Rights Act of 1964, 
and the Equal Educational Opportunities 
Act. 1 3 

In ruling on the discrimination claim, the 
Fifth Circuit agreed with the trial court that 
the evidence was insufficient to support a 
holding of present intent to discriminate. 14 
The Fifth Circuit upheld the trial court’s 
finding that past purposeful discrimination 
had affected the plaintiffs, and that the 
diploma sanction perpetuated that discrimina- 
tion. 15 The trial court found that until 1967, 
Florida operated a dual school system, segre- 
gating intentionally by race. The segregation 
persisted until 1971, and predominantly black 
schools were substantially inferior in terms of 
facilities, course offerings, materials and 
equipment. 16 The court required that the 
government, in attempting to justify the 
discriminatory impact of the test, show “either 
that the disproportionate failure of blacks was 
not due to the present effects of past dis- 
crimination or, that. . .the diploma sanction 
was necessary to remedy those effects.” 17 

The Fifth Circuit held that the trial court 
was correct in finding the government did 

11. Id. at 401. 

12. Id. 

13. Id. 

14. See Personnel Administrator v. Feeney, 442 U.S. 

256 (1979). 

15. Id. at 407. 

16. Id. 

17. Id. (applying the rule used in McNeal v. Tate 
County Sch. Dist, 508 F.2d 1017 (5th Cir. 1975)). 



not meet its burden and that the test could 
not be used as a requirement for graduation 
until all graduating seniors had completed all 
of their public education in a unitary system. 18 
The court upheld the trial court’s ruling that 
the test not be used for four years, except for 
assessment and remedial purposes, and re- 
manded the case to the trial court to consider 
the impact of past discrimination on the 
ability of the students to pass the test. 19 

Four years after the former Fifth Circuit 
ruled in Debra P. y the Eleventh Circuit held that 
the district court on remand correctly found 
that the vestiges of past intentional segregation 
did not cause the test’s disparate impact on 
blacks. 20 Despite evidence that such vestiges still 
existed in Florida schools, 21 the court accepted 
the trial court’s reliance on expert testimony 
that there was no causal link between the 
effects of past discrimination and the dispropor- 
tionate impact of the test on black students. 

The expert testified that other factors such as 
the “educational background of a student’s 
parents, class size, attendance, and amount of 
homework more directly relate to student 
performance.” 22 The court also found that the 
remedial efforts of the school to help failing 
students pass the test further severed the causal 
link between past discrimination and present 
disparate impact. 23 Additionally, the court noted 
that while 57% of students who failed the exam 
in 1983 were black, though blacks constituted 
only 20% of the class, more than 90% of blacks 
ultimately passed the test They found this fact 



18. Id. 

19. Id. 

20. Debra P. v. Turlington, 730 F.2d 1405 (11th Cir. 
1984). 

21. Evidence existed that teachers had racial biases 
which impaired their teaching ability, black 
students were still suspended more often than 
white students, racial stereotypes still persisted and 
blacks were more readily assigned to educable 
mentally retarded classes than whites. Addition- 
ally, though failure rates for blacks had improved 
since earlier administration, of the test, 57% of 
students who failed the test in 1983 were black, 
even though blacks constituted only 20% of the 
class. Id. at 1405. 

22. Id. at 1415. 

23. Id. 
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to be “strong evidence that vestiges of segrega- 
tion do not cause blacks to fail the test.” 24 

Finally, the Eleventh Circuit held that use 
of the test as a diploma sanction would help 
remedy the vestiges of past discrimination. 2 » 
The court relied on an expert who presented 
studies and testified that “the best way to 
encourage student performance is by setting 
objective standards and creating a ‘climate of 
order’ to motivate students.” 26 The court 
expressed a “heavy sense of discomfort over the 
unfairness if discriminatory vestiges. . .have in 
fact caused a student to fail [the test],” but 
decided that the unfairness “would be out- 
weighed by the demonstrated effect of the 
diploma sanction in remedying the greater 
unfairness of functional illiteracy.” 27 

This case and others indicate that courts will 
look to past practices of school districts to 
determine whether they bear a causal relation- 
ship to current disparate effects of testing. 28 
Districts which have long been integrated and 
which have continuously provided equal educa- 
tional opportunity may fare better in the event 
of a legal challenge than those which still have 
a tainted history of discrimination. If testing 
results in a disparate impact on minorities, 
schools that have such histories will be required 
to show that the disproportionate test results are 
not the present effects of past discrimination, or 
that use of the test and its accompanying 
sanctions is necessary to remedy those effects. 

Title VI Disparate Impact 

Even schools with no recent history of 
discrimination face challenges on the basis of 
disparate impact. Tide VI prohibits federally 
funded programs from implementing policies 
that have a disparate impact on minorities. 29 
While the United States Supreme Court has 

24. Id. 

25. Id. at 1416 

26. Id. 

27. Id. 

28. See also Anderson v. Banks, 520 F. Supp. 472 (S.D. 
Ga. 1981); Bester v. Tuscaloosa City Bd. of Educ., 

722 F.2d 1514 (11th Cir. 1984). 

29. 34 C.F.R. § 100.3(b)(2). 



limited Tide VI itself to constitutional param- 
eters requiring a showing of intent to discrimi- 
nate in order to prove a violation, 30 some courts 
allow individuals to sue on a disparate impact 
theory under the regulations implementing Tide 
VI. 31 In this context, differences in student 
performance based on race or national origin 
do not necessarily constitute an impermissible 
disparate impact. Rather, the disparities must be 
statistically significant, and must be present after 
control for other relevant variables such as 
family income. 

As previously mentioned, a United States 
district court in Texas recendy considered 
whether the Texas Assessment of Academic Skills 
(TAAS) examination had an impermissible 
adverse impact on that state’s minority stu- 
dents. 32 The court ruled that although the 
TAAS test did adversely affect minority students, 
the Texas Education Agency (TEA) demon- 
strated an educational necessity for the test, and 
the plaintiffs did not identify an equally effective 
alternative as required. 33 Thus, the court re- 
fused the plaintiffs’ request for an injunction 
preventing the TEA from using failure on the 
exit-level TAAS test as a basis for denying high 
school diplomas. 34 The analysis employed by the 
court should prove useful for other states as they 
consider whether their own high-stakes tests have 
an unlawful discriminatory impact. 

Texas’s efforts to impose accountability on its 
public school system began in 1984 when the 
legislature passed the Equal Educational Oppor- 
tunity Act. In 1987, Texas implemented its first 
high school graduation exit test. This exam was 
superseded in 1990 by the TAAS. Texas stu- 
dents in public schools must pass the TAAS exit- 
level exam before they can graduate high school 
and receive their diplomas. Texas public school 
students begin taking the TAAS test in third 
grade. The “exit-level” TAAS exam, the one 
which students must pass in order to graduate, 

30. See United States v. Fordice, 505 U.S. 717, 722 n. 

7 (1992). 

31. See , e.g . , G.I. Forum v. Texas Educ. Agency, 87 F. 

Supp. 2d 667 (W.D. Tex. 2000). 

32. G.I. Forum, 87 F. Supp. 2d 667. 

33. Id. 

34. Id. 
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is first administered in tenth grade. Students 
who do not pass on the first attempt are given 
at least seven additional opportunities to take 
and pass the TAAS exam before their scheduled 
graduation date. 

Nine minority Texas students who did not 
pass the TAAS exit-level exam prior to their 
graduation dates filed suit individually, demand- 
ing that their respective school districts issue 
their diplomas. 35 In assessing the plaintiffs’ 

Title VI claim, the court applied the burden- 
shifting analysis established by cases brought 
under Title VII of the Civil Rights Act of 1964. 36 
Under Title VII, the plaintiff makes a prima facie 
case of disparate impact by demonstrating that 
a facially neutral practice has caused a dispro- 
portionate adverse effect on a protected class. 37 
According to the court, if the plaintiff makes 
this showing, then the burden of production 
shifts to the defendant 38 The defendant must 
then produce evidence that the practice is 
justified by educational necessity. 39 If the 
defendant meets its burden, then the plaintiff 
may still prevail by showing that an equally 
effective alternative practice could result in less 
racial disproportionality while still serving the 
articulated need. 40 

In this case, the court found that plaintiffs 
made a prima facie showing of significant 
adverse impact. 41 In reaching this determina- 
tion, the court considered both the disparate 
impact on first-time administration of the 
exam and the cumulative pass rates of stu- 
dents after educational remediation. 42 The 
court noted that after educational 
remediation, the scores of minority students 
showed dramatic improvement. 43 Neverthe- 



35. Id. at 668, n. 1. 

36. Id. at 667. 

37. Id. (citing Wards Cove Packing Co. v. Atonio, 490 
U.S. 642, 656-57 (1989)). 

38. Id. 

39. Id. 

40. Id. (citing Watson v. Fort Worth Bank and Trust, 
487 U.S. 977, 986-87 (1988)). 

41. Id. at 678-79. 

42. Id. 

43. Id. 







less, the court determined that in all cases, 
whether after single or cumulative administra- 
tions of the exam, there were still significant 
statistical differences. 44 

After deciding that plaintiffs made a prima 
facie case of significant adverse impact, the 
court determined that the TEA met its burden 
of production on the question of whether the 
TAAS test was an educational necessity. 45 “[A]n 
educational necessity exists where the chal- 
lenged practice serves the legitimate educational 
goals of the institution.” 46 According to the 
court, the TEA merely needed to produce 
evidence that there was a manifest relationship 
between the TAAS test and a legitimate educa- 
tional goal. 47 The articulated goals of the 
TAAS tests were “to hold schools, students, 
and teachers accountable for education and to 
ensure that all Texas students receive the 
same, adequate learning opportunities.” 48 

Three arguments persuaded the court that 
the TAAS test bore a manifest relationship to 
a legitimate educational goal. First, the court 
believed that the TAAS test effectively and 
objectively measured whether students had 
mastered a discrete set of skills and knowl- 
edge. 49 Second, the court concluded that a 
passing standard served the state’s legitimate 
interest in requiring students to master 70 
percent of the tested minimal essentials prior 
to graduation. 50 Third, the court found that 
use of high-stakes tests as a “graduation re- 
quirement guarantees that students will be 
motivated to learn the curriculum tested.” 51 
Finally, the court considered whether the 
plaintiffs showed that equally effective alterna- 
tives existed. The plaintiffs offered evidence 
of alternative approaches including a sliding- 
scale system that would allow a student’s low 
test performance to be compensated by high 



44. 


Id. 


at 


679. 


45. 


Id. 


at 


679-81. 


46. 


Id. 


at 


679. 


47. 


Id. 






48. 


Id. 






49. 


Id. 


at 


680. 


50. 


Id. 






51. 


Id. 


at 


681. 
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academic grades and vice versa. However, the 
plaintiffs did not persuade the court that these 
alternatives could sufficiently motivate students 
to perform to their highest ability. As a result, 
the court ruled that plaintiffs “produced no 
alternative that adequately addressed the goal 
of systemic accountability. ”52 After going on to 
consider due process concerns, the court 
ultimately rejected all of plaintiffs’ claims and 
decided that the TAAS exit-level exam did not 
violate Title VI implementing regulations. 53 

The court’s analysis in this case highlights 
several important considerations for other 
school districts who employ high-stakes testing. 
First, effective remedial programs to support 
students who initially fail are essential. How- 
ever, even if remedial programs wholly elimi- 
nate the disparate effects of high-stakes testing 
prior to graduation, a court may still consider 
the disparate impact of the initial test. Sec- 
ond, the goals of any high-stakes testing re- 
gime should be well articulated in the legisla- 
tion or in the implementing regulations. In 
the face of challenge, the manifest relationship 
between the high-stakes testing and the articu- 
lated educational goals will be critical. Third, 
schools would do well to ensure that their 
tests are designed not only to effectively and 

52. Id. at 682. The need for specific articulated goals 
is critical during this final step of the analysis. At 
least one state uses a system assessment which 
allows accountability for teachers, school systems 
and individual schools, but not for individual 
students. “Value-added assessment,” as the method 
is termed, requires testing each student at each 
grade level. Individual student information is then 
aggregated and run through a complex statistical 
model which adjusts for complexities such as 
students who miss tests, students who move in and 
out of a school system, and students who skip 
grades. The data is then analyzed for a wide 
variety of purposes. For example, estimates of 
teacher effectiveness, the effect of race and 
socioeconomic factors on academic gains, and the 
effectiveness of particular school systems are a few 
of the findings the model can generate. <http:// 
www.aasa.org/SA/dec9801 .htm> 

Thus, if a school system’s goal was expressed 
simply as a desire for “general accountability,” 
potential plaintiffs could argue that an alternative 
method such as the value-added assessment could 
provide the desired accountability without impos- 
ing high-stakes on individual students. 

53. 87 F. Supp. 2d at 683-84. 



objectively measure student skills and knowl- 
edge, but also to compel progress in schools. 
Fourth, while this court sanctioned the use of 
Texas’s cut-score, schools should be prepared 
to defend the rationale for choosing a particu- 
lar cut-score. 

The court’s analysis in this case also leaves 
unanswered questions. The court modeled its 
disparate impact analysis after the test articu- 
lated in Wards Cove. 54 The second step of the 
Wards Cove test shifts only the burden of 
production to the defendant to produce evi- 
dence that the practice is justified by educa- 
tional necessity. 55 I n contrast, the Civil Rights 
Act of 1991 purports to overrule this portion 
of Wards Cove 55 Section 703 (k) (1) (A) (i) 
requires that during the second step, the 
defendant bear not only the burden of pro- 
duction, but also the burden of persuasion . 5 ? If 
other courts follow the text of Tide VII and 
case law subsequent to the Civil Rights Act of 
1991, then school districts will bear a heavier 
burden. Once a plaintiff establishes a prima 
facie case of disparate impact, the school 
district would then need to produce evidence 
that high-stakes testing is justified by educa- 
donal necessity, and persuade the court as to 
the weight of the evidence. Only if the school 
district successfully met that burden would the 
court require the plaintiff to prove equally 
effecdve alternatives. Because other courts can 
jusdfiably employ this requirement, schools 
should be prepared to meet the heavier bur- 
den. 

Due Process 

Strongly linked to the issue of discrimina- 
don is the nodon of procedural due process. 
The Fourteenth Amendment prohibits states 
from depriving cidzens of life, liberty and 
property without due process of law. Property 
rights are legitimate expectations of entitie- 
ment created through state law, regulations or 

54. Wards Cove Packing Co. v. Atonio, 490 U.S. 642 

(1989). 

55. Id. at 656-57. 

56. 42 U.S.C. § 2000e-2(k) (1) (A) (i). 

57. Id. 
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contracts, and play a direct role in determin- 
ing whether a student has a right to due 
process under the law. 

The Supreme Court has held that students 
have acquired a legitimate entidement to 
education as a property interest. 58 Extending 
that rule, the court in Debra P. held that “in 
establishing a system of free public education 
and in making school attendance mandatory, 
the state has created an expectation in the 
students. . .that if a student attends school. . 
.and takes and passes the required courses, he 
will receive a diploma.” 59 It found that the 
right to a diploma is an implied property right 
“as the term is used constitutionally.” 60 By 
allowing students a property right in a high 
school diploma (and arguably in the accompa- 
nying graduation ceremony), the court se- 
cured a benefit to students that may not be 
infringed without due process of law. 

It is the right to due process that has 
received the most attention by courts dealing 
with high-stakes testing. In Debra P. the court 
held that the implementation schedule for the 
exit exam, which was a new statutory require- 
ment for graduation, violated due process of 
law by depriving students of their property 
right to a diploma without adequate notice. 61 
The Florida Act at issue was amended in 1978 
to require passage of an exit exam for receipt 
of a diploma. The first students required to 
take the test were members of the graduating 
class of 1979. The court noted that such an 
“eleventh hour change,” with virtually no 
warning or time to prepare for the test, was 
too severe a requirement. 62 The court upheld 
the injunction against using the test for four 
years, and allowed its immediate use for 
purposes of assessment and remediation only. 

Texas has required students to pass a 
statewide competency exam in order to receive 
a high school diploma since 1985. However, 
in 1991 it began to require the exam as a 

58. Goss v. Lopez, 419 U.S. 565, 574 (1975). 

59. Debra P., 644 F.2d at 404. 

60. Id. 

61. Id. 

62. Id. 
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prerequisite to attending graduation ceremo- 
nies. The test is called the Texas Assessment 
of Academic Skills (TAAS), and it measures 
student performance in math and writing. In 
Crump v. Gilmer Independent School District , a 
1992 Eastern District of Texas opinion, the 
court held that two students who had com- 
pleted all requirements for graduation, but 
who had failed the TAAS exam by two points, 
were entitled to a preliminary injunction 
requiring the school to allow them to partici- 
pate in graduation exercises. 63 The court 
addressed the issue of notice and applied the 
rule from Debra P. that due process requires 
students be given adequate notice that passing 
the test is a prerequisite to graduation. 64 The 
court reasoned that notice is necessary to give 
students adequate opportunity to prepare for 
the test, to give the school district adequate 
time to prepare a remedial program, and to 
have sufficient time to correct deficiencies in 
the test and set an appropriate passing 
score. 65 

Despite the fact that Texas had previously 
required passage of the TAAS exam for 
receipt of a diploma, the court relied on the 
fact that it was only made a requirement for 
graduation in the fall of 1991. The students 
were required to take the test in the spring of 
1992, some 18 months later. The court noted 
other cases in which implementation periods 
of 13 and 18 months were held to be uncon- 
stitutional, and determined that it was likely 
the implementation period would be found to 
be constitutionally deficient. 66 In granting the 
injunction ordering the school to allow the 
students to participate in the graduation 
ceremony, the court required the plaintiffs to 
sign affidavits swearing they would continue to 
take the TAAS exam until they passed. 67 

Interestingly, in the same year in the 
Western District of Texas, a student who failed 



63. Crump v. Gilmer Indep. Sch. Dist., 797 F. Supp. 
552, 554 (E.D. Tex. 1992). 

64. Id. 

65. Id. 

66. Id. at 555. 

67. Id. 
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the TAAS exam by one point was denied a 
temporary restraining order that would have 
allowed him to participate in his graduation 
ceremony. 68 Contrary to the holding in 
Crump, the court in Williams v. Austin Inde- 
pendent School District found that the student 
actually had seven years notice between the 
implementation of the test and the time he 
actually took the test.® In Crump, the court 
measured the implementation period from 
the date in 1991 when the rule was an- 
nounced that the TAAS exam would now be 
required for graduation, to the 1992-examina- 
tion date. In contrast, the Williams court 
measured the implementation period from 
the original enactment of the Act requiring 
the TAAS for receipt of a diploma, which was 
in 1985. It reasoned that the student had 
known for seven years that he had to pass 
the TAAS exam for receipt of a diploma, 
and impliedly correlated receipt of the 
diploma with graduation. 70 

In another interesting note the court in 
Williams stressed that the student who was 
denied the right to attend graduation cer- 
emonies suffered no irreparable injury. It 
reasoned that “walking across the stage 
certainly does not rise to the level of a 
constitutionally protected property interest 
any more than attending one’s high school 
prom. . ,” 71 Furthermore, it reasoned that 
upon completion of the TAAS exam, the 
student could attend a graduation at a later 
date. In comparison, the Crump court specifi- 
cally found that plaintiffs would “suffer irrepa- 
rable harm if they [were] denied the opportu- 
nity to participate in their graduation cer- 
emony.” 7 - That court reasoned that though 
the school has a strong interest in “instilling 
pride in accomplishment by giving students a 
strong incentive to complete high school 
successfully,” these “marginal benefits. . .are 

68. Williams v. Austin Indep. Sch. Dist., 796 F. Supp. 

251 (W.D. Tex. 1992). 

69. Id. at 253. 

70. Id. 

71. Id. at 254. 

72. Crump, 797 F. Supp. at 554. 
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outweighed by the tremendous potential for 
irreparable harm to [the students].” 78 

The judge in Williams acknowledged the 
disparity in the decisions of the two courts 
and stated his disagreement with the Crump 
holding. “Any state interference in the [edu- 
cational] process is simply destructive to the 
attempts by the state to salvage its educational 
system, and this includes interference by the 
federal judiciary.” 7 -* 

The tensions in the case law illuminate 
that this area of law is not well settled. It is 
clear, however, that the analysis will be in- 
tensely fact-based. Schools should err on the 
side of caution, and those school systems 
implementing high-stakes testing should allow 
adequate time for students to prepare for the 
exam. They also must allow sufficient time to 
prepare and implement a remedial program, 
and to correct any deficiencies in the test. 75 
Attorneys representing school districts should 
be familiar with the testing program, and 
should be prepared to make a thorough and 
competent defense of the program on short 
notice in an injunction hearing. 

Systems in which segregation has only 
recendy been remedied must pay special 
attention to this requirement. They must 
allow sufficient time for the present effects of 
segregation to dissipate, and should engage in 
aggressive remedial efforts where possible. 

The court in Debra P. relied on expert testi- 
mony that in order to meet constitutional 
standards, at least “four to six years should 
intervene between the announcement of the 
objectives and the implementation of the 
diploma sanction.” 76 While this may not be a 
magic number in a court of law, it is a good 
starting point. 

Fundamental Fairness — Test Validity 

Another issue that inevitably implicates 
due process is termed “fundamental fairness.” 



73. Id. 

74. Williams, 796 F. Supp. at 255. 

75. Id. 

76. 474 F. Supp. 244, 267 (M.D. Fla. 1979). 
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This issue hinges on the validity of the mini- 
mum competency test, and is often at the 
heart of litigation surrounding high-stakes 
testing. Can schools adequately prepare 
students for competency tests that carry high- 
stakes? How do they ensure each student has 
an opportunity to learn the material that is 
tested? These are difficult questions, and the 
answers are unclear. 

The issue of fundamental fairness was 
explored in Debra P , where receipt of a high 
school diploma was dependent on obtaining a 
passing score on an exit exam. The court 
held that if the exam covered materials not 
taught in the schools, it was fundamentally 
unfair and a violation of both equal protection 
and due process . 77 In addition, the court held 
that the test must be a “fair test of that which 
was taught .” 78 In remanding the issue for 
further findings of fact, the court applied 
rational basis scrutiny to determine that a 
functional literacy exam bears a rational rela- 
tion to a valid state interest . 79 Hence the use 
of an exam-for-diploma is likely to be upheld 
as long as it is fundamentally fair. The ap- 
peals court, however, vacated the district 
court’s finding of validity, because it found 
insufficient evidence that the Florida schools 
actually taught the materials that were tested . 80 
The court noted that an important component 
of a valid test is curricular validity (also known 
as curriculum-to-test match ). 81 Curricular 
validity has two components. First, the test 
items must adequately correspond to the 
required curriculum in which the students 
should have been instructed before they take 
the test. Second, the test must correspond to 
the material actually taught in the school 
regardless of what should have been taught . 82 
The State Board of Education stipulated that it 
had made no effort to ascertain whether or 
not the performance standards were actually 



77. 


644 F.2d at 397. 


78. 


Id. at 406. 


79. 


Id. 


80. 


Id. 


81. 


Id. 


82. 


See Crump, 797 F. Supp. at 555. 



being taught in Florida public schools . 83 Addi- 
tionally, no formal studies were conducted to 
verify curricular validity, and a teacher testified 
that he did not cover all materials in class . 84 
The court noted the trial court’s statement 
that “the test was probably a good test of what 
the students should know, but not necessarily 
of what they had the opportunity to learn .” 85 
The rule announced in Debra R has be- 
come the barometer of the fundamental 
fairness of high-stakes tests. All other courts 
addressing the constitutionality of such tests 
have relied on the “curricular validity” or 
“instructional validity” test. In the two Texas 
cases mentioned above, the courts arrived at 
inconsistent findings on the issue of instruc- 
tional validity, even though the facts in each 
case should have been nearly identical. In 
Crump the court granted the students a pre- 
liminary injunction barring the school from 
prohibiting them from participating in gradua- 
tion ceremonies because they failed the TAAS 
exam . 86 The court expressed considerable 
doubt that the defendant would be able to 
show instructional validity. It noted that 
teachers and administrators were not allowed 
to view the tests, and therefore they had no 
way to determine whether the exam was based 
on material taught in the classroom . 87 

Under the same facts, the Williams court 
denied a temporary restraining order on behalf 
of a student who wished to attend graduation 
ceremonies, by finding “substantial evidence” 
that the school taught, and the student took, 
courses which adequately prepared him for the 
TAAS exam . 88 While not focusing on specific 
course content, it credited the testimony of a 

83. However the Department of Education noted the 
absence only of formal studies and indicated it 
would be able to prove the test actually covered 
materials taught in the classroom. This was the 
case, because on remand the district court (and the 
Eleventh Circuit on appeal) held the test to be 
instructionally valid. See Debra P. v. Turlington, 730 
F.2d 1405 (11th Cir. 1984). 

84. Debra P.; 644 F.2d at 405. 

85. Id. 

86. Crump, 797 F. Supp. at 556. 

87. Id. 

88. Williams, 796 F. Supp. at 254 
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math teacher stating that the math portion of 
the test covers geometry and algebra, and that 
the student had taken both classes in school. 
The court also noted the student had received 
remedial instruction in math to help him pass 
the test after he had previously failed it. 

Finally, the court acknowledged but gave no 
weight to the fact that teachers and adminis- 
trators are not permitted to view the contents 
of the test. 

The underlying issue in these cases is 
whether students have the opportunity to learn 
the materials that are tested on the exam. As 
Debra P. states, to be fundamentally fair, the 
test must cover only subjects taught in school 
and must be a fair test of that which is taught 
in each subject. Ensuring instructional validity 
is no small challenge. Not only must schools 
ensure test validity for the students who grow 
up in a particular school system and attend 
school everyday, they must address how to 
ensure a curriculum-to-test match for students 
transferring into the system at various times. 
For instance, many states require testing only 
in certain school years. In Virginia, students 
are tested in grades three, five, eight and high 
school. Those schools must find ways to 
ensure students transferring into the system at 
grade seven are equipped to take the test in 
grade eight. This can pose a real challenge 
for any school when test performance is tied 
to consequences, especially when those conse- 
quences include revocation of the school’s 
accreditation. 

The cases demonstrate that whether a test 
is fundamentally fair is a fact-intensive analysis, 
and outcomes can vary widely, even on the 
same set of facts. Schools would do well to 
assess their own tests for curricular validity 
prior to a legal action arising. Schools may 
employ outside agencies to assess whether 
what is tested is actually taught, and should 
document such findings, including changes 
made to the curriculum and the test. While 
this may be an arduous task, a school district 
interested in implementing accountability 
mechanisms such as high-stakes tests must be 
prepared for legal challenges that likely will 
arise. 
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CONCLUSION 

While the paucity of case law in the area of 
high-stakes testing inhibits the ability to make 
general conclusions, the issue appears to impli- 
cate few surprising or novel legal theories. In 
general, deference to the states and to local 
school boards will continue, and the traditional 
legal regimes covering due process and equal 
protection will remain intact. However, it is 
important to note that the consequences of 
high-stakes testing for students and teachers 
extend beyond the mere issues of grade promo- 
tion and graduation. The impact on students 
of being denied a diploma can be enduring, 
affecting access to employment opportunities 
and higher education, while the effect on 
teachers may be equally dramatic. (See D. 
Farmelo, Using Student Test Results to Evaluate 
Educational Professionals and Institutions , infra , for 
discussion of the impact of high-stakes testing 
programs when used to evaluate teachers.) The 
implementation of such high-stakes mechanisms 
means school systems must be prepared to 
defeat potential litigation by preemptively 
ensuring their school reform not only is educa- 
tionally valid, but also legally sound. Appendix 
A provides a practical checklist of the issues 
discussed in this chapter to assist the practitio- 
ner in constructing, implementing, and assessing 
a legally sound high-stakes testing program. 
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Special Considerations 
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The legal issues surrounding high-stakes 
testing are very complicated even if one does 
not factor in the special considerations 
required by students with special needs. It is 
apparent, however, that a large number of 
students within a school district may require 
special considerations in the administration 
of high-stakes testing. These special needs 
may arise as a result of a disability, limited 
English proficiency, or other special status. 
This chapter will discuss the issues arising in 
the testing of students who have these special 
needs. 1 

TESTING OF STUDENTS WITH 
DISABILITIES 

The increased use of high-stakes tests has 
created additional problems in the testing of 
students with disabilities. Tests are routinely 
used to determine whether a student qualifies 
as a student with disabilities. But other testing 
issues must be addressed when determining 
whether students with disabilities should partici- 
pate in high-stakes tests including: what accom- 
modations are required for effective participa- 
tion, whether the high-stakes test would be 
invalidated as a result of these test accommo- 
dations and whether the particular test is an 
appropriate instrument for use with the special 



1. The reader should also follow the status of 

legislation pending in Congress, H.R. 4333- 

Fairness and Accuracy in Student Testing, 
106th Cong., 2d Sess. (2000). That legisla- 

tion, if passed, could affect the opinions in 
this article. 



needs student These various issues are ad- 
dressed in the following discussion. 

Individuals with Disabilities Education Act 

The Individuals with Disabilities Education 
Act, 2 (“IDEA” or “1997 IDEA”) places special 
emphasis in its 1997 reauthorization on 
increased expectations for performance by 
children with disabilities. The 1997 IDEA 
also requires an assessment of the effective- 
ness of the methods used to educate these 
special children. 3 Obviously, testing serves as 
an important tool for judging whether the 
methods used to educate students with dis- 
abilities are successful. 

The emphasis on increased expectations 
and outcomes has also focused more atten- 
tion on educating students with disabilities in 
the regular classroom and on insuring their 
participation in the general curriculum. In 
furtherance of this goal, the 1997 IDEA 
requires that students with disabilities be 
educated with their non-disabled peers and in 
general education classes “[t]o the maximum 
extent appropriate.” 4 Students with disabili- 
ties are to be placed in settings other than 
the regular classroom only when the nature 
or severity of the disability is such that the 
student cannot be educated successfully in 
the regular classroom with the use of supple- 



2. 20 U.S.C. § 1400 et seq. 

3. 20 U.S.C. § 1400(c). 

4. 20 U.S.C. § 1412(a)(5)(A). 



